Skip to content

Fix diffusion fallback guards and validation#23335

Merged
BBuf merged 17 commits into
sgl-project:mainfrom
BBuf:codex/diffusion-cleanup-audit
May 6, 2026
Merged

Fix diffusion fallback guards and validation#23335
BBuf merged 17 commits into
sgl-project:mainfrom
BBuf:codex/diffusion-cleanup-audit

Conversation

@BBuf
Copy link
Copy Markdown
Collaborator

@BBuf BBuf commented Apr 21, 2026

Summary

  • Fix native diffusion RoPE fallbacks to accept full-width interleaved cos/sin caches, matching the Triton path.
  • Tighten CuTe fused scale/shift validation for [B, F, 1, D] inputs where S is not divisible by F.
  • Add a focused test for the frame divisibility guard.

Validation

  • python3 -m py_compile python/sglang/jit_kernel/diffusion/triton/torch_fallback.py python/sglang/jit_kernel/diffusion/triton/npu_fallback.py python/sglang/jit_kernel/diffusion/cutedsl/scale_residual_norm_scale_shift.py

Note

  • Dropped the diffusion case parser/dashboard changes from this PR so it no longer changes CI partition/coverage behavior. Parser append handling should be handled separately once the newly exposed dynamic cases are stable.

@gemini-code-assist
Copy link
Copy Markdown
Contributor

Warning

You have reached your daily quota limit. Please wait up to 24 hours and I will start processing your requests again!

@BBuf BBuf force-pushed the codex/diffusion-cleanup-audit branch from 75c87b5 to bdbcfac Compare April 21, 2026 07:01
@BBuf
Copy link
Copy Markdown
Collaborator Author

BBuf commented Apr 21, 2026

/tag-and-rerun-ci

@BBuf
Copy link
Copy Markdown
Collaborator Author

BBuf commented Apr 25, 2026

/tag-and-rerun-ci

1 similar comment
@BBuf
Copy link
Copy Markdown
Collaborator Author

BBuf commented Apr 25, 2026

/tag-and-rerun-ci

@BBuf
Copy link
Copy Markdown
Collaborator Author

BBuf commented May 2, 2026

/tag-and-rerun-ci

@BBuf
Copy link
Copy Markdown
Collaborator Author

BBuf commented May 3, 2026

/tag-and-rerun-ci

@BBuf
Copy link
Copy Markdown
Collaborator Author

BBuf commented May 4, 2026

/tag-and-rerun-ci

@BBuf BBuf changed the title Fix diffusion CI parsing and fallback guards Fix diffusion fallback guards and validation May 6, 2026
@BBuf
Copy link
Copy Markdown
Collaborator Author

BBuf commented May 6, 2026

@BBuf BBuf merged commit d86f291 into sgl-project:main May 6, 2026
97 of 134 checks passed
ltcs11 added a commit to ltcs11/sglang that referenced this pull request May 7, 2026
* main: (894 commits)
  [Bug Fix] Fix RunAI streamer: corrupted weights, missing quant init, and broken URIs for multimodal models (sgl-project#22715)
  [Kernel] Deprecate DeepGemm in sgl kernel and apply custom wheel sgl-deep-gemm (sgl-project#24268)
  propagate pytest exit code from test __main__ entries (sgl-project#24487)
  [R3] Avoid implicit CUDA sync in routed experts DP slicing (sgl-project#24550)
  Add ChatCompletionRequest-style support to /v1/tokenize (sgl-project#23981)
  Support Triton MLA FP8 KV cache (sgl-project#20479)
  [diffusion] chore: align LTX-2 with official (sgl-project#24313)
  Expand support matrix for pypi wheel release (sgl-project#24565)
  [codex] Optimize Z-Image packed QKV (sgl-project#24117)
  [Misc] Fix breaking weight checker test (sgl-project#24553)
  [LoRA] Fix qkv_proj LoRA buffer sizing when tp_size > num_key_value_heads (sgl-project#24420)
  ci: bump test_mimo_models.py est_time 330 → 610 (sgl-project#24551)
  [CI] Temporarily disable marco/mcdse-2b-v1 in test_embedding_models (sgl-project#24279)
  Improve metrics, observability, and PD deploy tooling (sgl-project#24521)
  Fix diffusion fallback guards and validation (sgl-project#23335)
  [PD] Prevent update_status to Failed from cleared entries (sgl-project#24539)
  [CP] Register KV cache allgather buffer with symmetric memory (sgl-project#24040)
  Support getting checksums in weight checker (sgl-project#24537)
  Refactor buffer patterns in weight checker (sgl-project#24538)
  Add unit and end-to-end tests for weight checker (sgl-project#24536)
  ...

# Conflicts:
#	python/sglang/srt/managers/scheduler.py
#	python/sglang/srt/model_executor/model_runner.py
LLThomas pushed a commit to LLThomas/sglang that referenced this pull request May 8, 2026
LucQueen pushed a commit to LucQueen/sglang that referenced this pull request May 12, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants